Dataset statistics
| Number of variables | 33 |
|---|---|
| Number of observations | 199522 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 3172 |
| Duplicate rows (%) | 1.6% |
| Total size in memory | 50.2 MiB |
| Average record size in memory | 264.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 22 |
| Text | 1 |
| Dataset has 3172 (1.6%) duplicate rows | Duplicates |
edu_inst is highly imbalanced (74.6%) | Imbalance |
mace is highly imbalanced (62.2%) | Imbalance |
hispanic is highly imbalanced (71.7%) | Imbalance |
labor_union is highly imbalanced (67.5%) | Imbalance |
reason_unemployment is highly imbalanced (89.9%) | Imbalance |
migration_msa is highly imbalanced (55.1%) | Imbalance |
migration_reg is highly imbalanced (52.5%) | Imbalance |
migration_within is highly imbalanced (54.5%) | Imbalance |
citizen is highly imbalanced (70.8%) | Imbalance |
person_income is highly imbalanced (68.0%) | Imbalance |
own_bus is highly imbalanced (94.5%) | Imbalance |
income is highly imbalanced (66.4%) | Imbalance |
divdends is highly skewed (γ1 = 27.78643274) | Skewed |
age has 2839 (1.4%) zeros | Zeros |
industry_code has 100683 (50.5%) zeros | Zeros |
occupation_code has 100683 (50.5%) zeros | Zeros |
wage_per_hour has 188218 (94.3%) zeros | Zeros |
gains has 192143 (96.3%) zeros | Zeros |
losses has 195616 (98.0%) zeros | Zeros |
divdends has 178381 (89.4%) zeros | Zeros |
person_worked has 95982 (48.1%) zeros | Zeros |
week_workd has 95982 (48.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-18 10:35:55.169077 |
|---|---|
| Analysis finished | 2024-05-18 10:36:05.279845 |
| Duration | 10.11 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
age
Real number (ℝ)
ZEROS 
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.494006 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 2839 |
| Zeros (%) | 1.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 15 |
| median | 33 |
| Q3 | 50 |
| 95-th percentile | 75 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 35 |
Descriptive statistics
| Standard deviation | 22.310785 |
|---|---|
| Coefficient of variation (CV) | 0.64680179 |
| Kurtosis | -0.73279952 |
| Mean | 34.494006 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.37329807 |
| Sum | 6882313 |
| Variance | 497.77111 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34 | 3489 | 1.7% |
| 35 | 3450 | 1.7% |
| 36 | 3353 | 1.7% |
| 31 | 3351 | 1.7% |
| 33 | 3340 | 1.7% |
| 5 | 3332 | 1.7% |
| 4 | 3318 | 1.7% |
| 3 | 3279 | 1.6% |
| 37 | 3278 | 1.6% |
| 38 | 3277 | 1.6% |
| Other values (81) | 166055 |
| Value | Count | Frequency (%) |
| 0 | 2839 | |
| 1 | 3138 | |
| 2 | 3236 | |
| 3 | 3279 | |
| 4 | 3318 | |
| 5 | 3332 | |
| 6 | 3171 | |
| 7 | 3218 | |
| 8 | 3187 | |
| 9 | 3162 |
| Value | Count | Frequency (%) |
| 90 | 725 | |
| 89 | 195 | 0.1% |
| 88 | 241 | 0.1% |
| 87 | 301 | |
| 86 | 348 | |
| 85 | 423 | |
| 84 | 519 | |
| 83 | 561 | |
| 82 | 615 | |
| 81 | 720 |
class_of_worker
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Private | |
| Self-employed-not incorporated | 8445 |
| Local government | 7784 |
| State government | 4227 |
| Other values (4) | 6794 |
Length
| Max length | 31 |
|---|---|
| Median length | 16 |
| Mean length | 14.021146 |
| Min length | 8 |
Characters and Unicode
| Total characters | 2797527 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Self-employed-not incorporated |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Private |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 100244 | |
| Private | 72028 | |
| Self-employed-not incorporated | 8445 | 4.2% |
| Local government | 7784 | 3.9% |
| State government | 4227 | 2.1% |
| Self-employed-incorporated | 3265 | 1.6% |
| Federal government | 2925 | 1.5% |
| Never worked | 439 | 0.2% |
| Without pay | 165 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 100244 | |
| in | 100244 | |
| universe | 100244 | |
| private | 72028 | |
| government | 14936 | 3.5% |
| self-employed-not | 8445 | 2.0% |
| incorporated | 8445 | 2.0% |
| local | 7784 | 1.8% |
| state | 4227 | 1.0% |
| self-employed-incorporated | 3265 | 0.8% |
| Other values (5) | 4133 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 423995 | ||
| e | 360622 | |
| i | 284391 | |
| n | 250515 | |
| t | 216147 | |
| r | 214431 | |
| v | 187647 | 6.7% |
| o | 167143 | 6.0% |
| N | 100683 | 3.6% |
| u | 100409 | 3.6% |
| Other values (19) | 491544 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2150590 | |
| Space Separator | 423995 | 15.2% |
| Uppercase Letter | 199522 | 7.1% |
| Dash Punctuation | 23420 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 360622 | |
| i | 284391 | |
| n | 250515 | |
| t | 216147 | |
| r | 214431 | |
| v | 187647 | |
| o | 167143 | |
| u | 100409 | 4.7% |
| s | 100244 | 4.7% |
| a | 98839 | 4.6% |
| Other values (11) | 170202 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 100683 | |
| P | 72028 | |
| S | 15937 | 8.0% |
| L | 7784 | 3.9% |
| F | 2925 | 1.5% |
| W | 165 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 423995 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23420 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2350112 | |
| Common | 447415 | 16.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 360622 | |
| i | 284391 | |
| n | 250515 | |
| t | 216147 | |
| r | 214431 | |
| v | 187647 | |
| o | 167143 | |
| N | 100683 | 4.3% |
| u | 100409 | 4.3% |
| s | 100244 | 4.3% |
| Other values (17) | 367880 |
Common
| Value | Count | Frequency (%) |
| 423995 | ||
| - | 23420 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2797527 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 423995 | ||
| e | 360622 | |
| i | 284391 | |
| n | 250515 | |
| t | 216147 | |
| r | 214431 | |
| v | 187647 | 6.7% |
| o | 167143 | 6.0% |
| N | 100683 | 3.6% |
| u | 100409 | 3.6% |
| Other values (19) | 491544 |
industry_code
Real number (ℝ)
ZEROS 
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.352397 |
| Minimum | 0 |
|---|---|
| Maximum | 51 |
| Zeros | 100683 |
| Zeros (%) | 50.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 33 |
| 95-th percentile | 44 |
| Maximum | 51 |
| Range | 51 |
| Interquartile range (IQR) | 33 |
Descriptive statistics
| Standard deviation | 18.067141 |
|---|---|
| Coefficient of variation (CV) | 1.1768287 |
| Kurtosis | -1.501116 |
| Mean | 15.352397 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.51667949 |
| Sum | 3063141 |
| Variance | 326.4216 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 100683 | |
| 33 | 17070 | 8.6% |
| 43 | 8283 | 4.2% |
| 4 | 5984 | 3.0% |
| 42 | 4683 | 2.3% |
| 45 | 4482 | 2.2% |
| 29 | 4209 | 2.1% |
| 37 | 4022 | 2.0% |
| 41 | 3964 | 2.0% |
| 32 | 3596 | 1.8% |
| Other values (42) | 42546 |
| Value | Count | Frequency (%) |
| 0 | 100683 | |
| 1 | 827 | 0.4% |
| 2 | 2196 | 1.1% |
| 3 | 563 | 0.3% |
| 4 | 5984 | 3.0% |
| 5 | 553 | 0.3% |
| 6 | 554 | 0.3% |
| 7 | 422 | 0.2% |
| 8 | 550 | 0.3% |
| 9 | 993 | 0.5% |
| Value | Count | Frequency (%) |
| 51 | 36 | < 0.1% |
| 50 | 1704 | 0.9% |
| 49 | 610 | 0.3% |
| 48 | 652 | 0.3% |
| 47 | 1644 | 0.8% |
| 46 | 187 | 0.1% |
| 45 | 4482 | |
| 44 | 2549 | 1.3% |
| 43 | 8283 | |
| 42 | 4683 |
occupation_code
Real number (ℝ)
ZEROS 
| Distinct | 47 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.306613 |
| Minimum | 0 |
|---|---|
| Maximum | 46 |
| Zeros | 100683 |
| Zeros (%) | 50.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 26 |
| 95-th percentile | 38 |
| Maximum | 46 |
| Range | 46 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 14.454218 |
|---|---|
| Coefficient of variation (CV) | 1.2783862 |
| Kurtosis | -0.89654589 |
| Mean | 11.306613 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.82923051 |
| Sum | 2255918 |
| Variance | 208.92442 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 100683 | |
| 2 | 8756 | 4.4% |
| 26 | 7887 | 4.0% |
| 19 | 5413 | 2.7% |
| 29 | 5105 | 2.6% |
| 36 | 4145 | 2.1% |
| 34 | 4025 | 2.0% |
| 10 | 3683 | 1.8% |
| 16 | 3445 | 1.7% |
| 23 | 3392 | 1.7% |
| Other values (37) | 52988 |
| Value | Count | Frequency (%) |
| 0 | 100683 | |
| 1 | 544 | 0.3% |
| 2 | 8756 | 4.4% |
| 3 | 3195 | 1.6% |
| 4 | 1364 | 0.7% |
| 5 | 855 | 0.4% |
| 6 | 441 | 0.2% |
| 7 | 731 | 0.4% |
| 8 | 2151 | 1.1% |
| 9 | 738 | 0.4% |
| Value | Count | Frequency (%) |
| 46 | 36 | < 0.1% |
| 45 | 172 | 0.1% |
| 44 | 1592 | |
| 43 | 1382 | |
| 42 | 1918 | |
| 41 | 1592 | |
| 40 | 617 | 0.3% |
| 39 | 1017 | 0.5% |
| 38 | 3003 | |
| 37 | 2234 |
education
Categorical
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| High school graduate | |
|---|---|
| Children | |
| Some college but no degree | |
| Bachelors degree(BA AB BS) | |
| 7th and 8th grade | |
| Other values (12) |
Length
| Max length | 39 |
|---|---|
| Median length | 35 |
| Mean length | 19.86398 |
| Min length | 9 |
Characters and Unicode
| Total characters | 3963301 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Some college but no degree |
|---|---|
| 2nd row | 10th grade |
| 3rd row | Children |
| 4th row | Children |
| 5th row | Some college but no degree |
Common Values
| Value | Count | Frequency (%) |
| High school graduate | 48406 | |
| Children | 47422 | |
| Some college but no degree | 27820 | |
| Bachelors degree(BA AB BS) | 19865 | |
| 7th and 8th grade | 8007 | 4.0% |
| 10th grade | 7557 | 3.8% |
| 11th grade | 6876 | 3.4% |
| Masters degree(MA MS MEng MEd MSW MBA) | 6541 | 3.3% |
| 9th grade | 6230 | 3.1% |
| Associates degree-occup /vocational | 5358 | 2.7% |
| Other values (7) | 15440 | 7.7% |
Length
| Value | Count | Frequency (%) |
| school | 50199 | 8.2% |
| graduate | 48406 | 7.9% |
| high | 48406 | 7.9% |
| children | 47422 | 7.7% |
| grade | 36691 | 6.0% |
| no | 29946 | 4.9% |
| degree | 29613 | 4.8% |
| some | 27820 | 4.5% |
| college | 27820 | 4.5% |
| but | 27820 | 4.5% |
| Other values (42) | 239176 |
Most occurring characters
| Value | Count | Frequency (%) |
| 613319 | ||
| e | 459560 | 11.6% |
| o | 247528 | 6.2% |
| r | 244585 | 6.2% |
| g | 239230 | 6.0% |
| d | 225420 | 5.7% |
| h | 215130 | 5.4% |
| a | 205650 | 5.2% |
| l | 180610 | 4.6% |
| t | 150965 | 3.8% |
| Other values (37) | 1181304 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2803273 | |
| Space Separator | 613319 | 15.5% |
| Uppercase Letter | 402775 | 10.2% |
| Decimal Number | 69931 | 1.8% |
| Close Punctuation | 29462 | 0.7% |
| Open Punctuation | 29462 | 0.7% |
| Dash Punctuation | 9721 | 0.2% |
| Other Punctuation | 5358 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 459560 | |
| o | 247528 | |
| r | 244585 | |
| g | 239230 | |
| d | 225420 | |
| h | 215130 | |
| a | 205650 | |
| l | 180610 | 6.4% |
| t | 150965 | 5.4% |
| c | 133668 | 4.8% |
| Other values (9) | 500927 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 87794 | |
| S | 62560 | |
| A | 62533 | |
| M | 49373 | |
| H | 48406 | |
| C | 47422 | |
| E | 14345 | 3.6% |
| D | 12754 | 3.2% |
| W | 6541 | 1.6% |
| L | 4405 | 1.1% |
| Other values (3) | 6642 | 1.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 26053 | |
| 7 | 8007 | 11.4% |
| 8 | 8007 | 11.4% |
| 0 | 7557 | 10.8% |
| 9 | 6230 | 8.9% |
| 2 | 3925 | 5.6% |
| 5 | 3277 | 4.7% |
| 6 | 3277 | 4.7% |
| 3 | 1799 | 2.6% |
| 4 | 1799 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 613319 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 29462 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 29462 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9721 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 5358 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3206048 | |
| Common | 757253 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 459560 | |
| o | 247528 | 7.7% |
| r | 244585 | 7.6% |
| g | 239230 | 7.5% |
| d | 225420 | 7.0% |
| h | 215130 | 6.7% |
| a | 205650 | 6.4% |
| l | 180610 | 5.6% |
| t | 150965 | 4.7% |
| c | 133668 | 4.2% |
| Other values (22) | 903702 |
Common
| Value | Count | Frequency (%) |
| 613319 | ||
| ) | 29462 | 3.9% |
| ( | 29462 | 3.9% |
| 1 | 26053 | 3.4% |
| - | 9721 | 1.3% |
| 7 | 8007 | 1.1% |
| 8 | 8007 | 1.1% |
| 0 | 7557 | 1.0% |
| 9 | 6230 | 0.8% |
| / | 5358 | 0.7% |
| Other values (5) | 14077 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3963301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 613319 | ||
| e | 459560 | 11.6% |
| o | 247528 | 6.2% |
| r | 244585 | 6.2% |
| g | 239230 | 6.0% |
| d | 225420 | 5.7% |
| h | 215130 | 5.4% |
| a | 205650 | 5.2% |
| l | 180610 | 4.6% |
| t | 150965 | 3.8% |
| Other values (37) | 1181304 |
wage_per_hour
Real number (ℝ)
ZEROS 
| Distinct | 1240 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.427186 |
| Minimum | 0 |
|---|---|
| Maximum | 9999 |
| Zeros | 188218 |
| Zeros (%) | 94.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 495 |
| Maximum | 9999 |
| Range | 9999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 274.89711 |
|---|---|
| Coefficient of variation (CV) | 4.959608 |
| Kurtosis | 155.21813 |
| Mean | 55.427186 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.9350739 |
| Sum | 11058943 |
| Variance | 75568.424 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 188218 | |
| 500 | 734 | 0.4% |
| 600 | 546 | 0.3% |
| 700 | 534 | 0.3% |
| 800 | 507 | 0.3% |
| 1000 | 386 | 0.2% |
| 425 | 376 | 0.2% |
| 900 | 336 | 0.2% |
| 550 | 280 | 0.1% |
| 1200 | 256 | 0.1% |
| Other values (1230) | 7349 | 3.7% |
| Value | Count | Frequency (%) |
| 0 | 188218 | |
| 20 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 75 | 2 | < 0.1% |
| 100 | 11 | < 0.1% |
| 110 | 1 | < 0.1% |
| 125 | 1 | < 0.1% |
| 135 | 1 | < 0.1% |
| 143 | 1 | < 0.1% |
| 150 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 9999 | 1 | < 0.1% |
| 9916 | 1 | < 0.1% |
| 9800 | 2 | |
| 9400 | 2 | |
| 9000 | 1 | < 0.1% |
| 8800 | 1 | < 0.1% |
| 8600 | 1 | < 0.1% |
| 8500 | 1 | < 0.1% |
| 8300 | 1 | < 0.1% |
| 8000 | 4 |
edu_inst
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| High school | 6892 |
| College or university | 5688 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 16.032879 |
| Min length | 12 |
Characters and Unicode
| Total characters | 3198912 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | High school |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 186942 | |
| High school | 6892 | 3.5% |
| College or university | 5688 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 186942 | |
| in | 186942 | |
| universe | 186942 | |
| high | 6892 | 1.2% |
| school | 6892 | 1.2% |
| college | 5688 | 1.0% |
| or | 5688 | 1.0% |
| university | 5688 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 591674 | ||
| i | 392152 | |
| e | 390948 | |
| n | 379572 | |
| o | 212102 | 6.6% |
| s | 199522 | 6.2% |
| r | 198318 | 6.2% |
| v | 192630 | 6.0% |
| u | 192630 | 6.0% |
| t | 192630 | 6.0% |
| Other values (8) | 256734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2407716 | |
| Space Separator | 591674 | 18.5% |
| Uppercase Letter | 199522 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 392152 | |
| e | 390948 | |
| n | 379572 | |
| o | 212102 | |
| s | 199522 | |
| r | 198318 | |
| v | 192630 | |
| u | 192630 | |
| t | 192630 | |
| l | 18268 | 0.8% |
| Other values (4) | 38944 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 186942 | |
| H | 6892 | 3.5% |
| C | 5688 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 591674 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2607238 | |
| Common | 591674 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 392152 | |
| e | 390948 | |
| n | 379572 | |
| o | 212102 | |
| s | 199522 | |
| r | 198318 | |
| v | 192630 | |
| u | 192630 | |
| t | 192630 | |
| N | 186942 | |
| Other values (7) | 69792 | 2.7% |
Common
| Value | Count | Frequency (%) |
| 591674 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3198912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 591674 | ||
| i | 392152 | |
| e | 390948 | |
| n | 379572 | |
| o | 212102 | 6.6% |
| s | 199522 | 6.2% |
| r | 198318 | 6.2% |
| v | 192630 | 6.0% |
| u | 192630 | 6.0% |
| t | 192630 | 6.0% |
| Other values (8) | 256734 |
marital
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Never married | |
|---|---|
| Married-civilian spouse present | |
| Divorced | |
| Widowed | |
| Separated | 3460 |
| Other values (2) | 2183 |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 20.999845 |
| Min length | 8 |
Characters and Unicode
| Total characters | 4189931 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Divorced |
|---|---|
| 2nd row | Never married |
| 3rd row | Never married |
| 4th row | Never married |
| 5th row | Married-civilian spouse present |
Common Values
| Value | Count | Frequency (%) |
| Never married | 86485 | |
| Married-civilian spouse present | 84222 | |
| Divorced | 12710 | 6.4% |
| Widowed | 10462 | 5.2% |
| Separated | 3460 | 1.7% |
| Married-spouse absent | 1518 | 0.8% |
| Married-A F spouse present | 665 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| never | 86485 | |
| married | 86485 | |
| spouse | 84887 | |
| present | 84887 | |
| married-civilian | 84222 | |
| divorced | 12710 | 2.8% |
| widowed | 10462 | 2.3% |
| separated | 3460 | 0.8% |
| married-spouse | 1518 | 0.3% |
| absent | 1518 | 0.3% |
| Other values (2) | 1330 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 633649 | |
| r | 533322 | |
| 457964 | ||
| i | 448728 | |
| a | 265550 | 6.3% |
| s | 259215 | 6.2% |
| d | 209984 | 5.0% |
| v | 183417 | 4.4% |
| p | 174752 | 4.2% |
| n | 170627 | 4.1% |
| Other values (16) | 852723 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3444710 | |
| Space Separator | 457964 | 10.9% |
| Uppercase Letter | 200852 | 4.8% |
| Dash Punctuation | 86405 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 633649 | |
| r | 533322 | |
| i | 448728 | |
| a | 265550 | |
| s | 259215 | |
| d | 209984 | 6.1% |
| v | 183417 | 5.3% |
| p | 174752 | 5.1% |
| n | 170627 | 5.0% |
| o | 109577 | 3.2% |
| Other values (7) | 455889 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 86485 | |
| M | 86405 | |
| D | 12710 | 6.3% |
| W | 10462 | 5.2% |
| S | 3460 | 1.7% |
| A | 665 | 0.3% |
| F | 665 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 457964 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 86405 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3645562 | |
| Common | 544369 | 13.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 633649 | |
| r | 533322 | |
| i | 448728 | |
| a | 265550 | |
| s | 259215 | 7.1% |
| d | 209984 | 5.8% |
| v | 183417 | 5.0% |
| p | 174752 | 4.8% |
| n | 170627 | 4.7% |
| o | 109577 | 3.0% |
| Other values (14) | 656741 |
Common
| Value | Count | Frequency (%) |
| 457964 | ||
| - | 86405 | 15.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4189931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 633649 | |
| r | 533322 | |
| 457964 | ||
| i | 448728 | |
| a | 265550 | 6.3% |
| s | 259215 | 6.2% |
| d | 209984 | 5.0% |
| v | 183417 | 4.4% |
| p | 174752 | 4.2% |
| n | 170627 | 4.1% |
| Other values (16) | 852723 |
mace
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| White | |
|---|---|
| Black | |
| Asian or Pacific Islander | 5835 |
| Other | 3657 |
| Amer Indian Aleut or Eskimo | 2251 |
Length
| Max length | 28 |
|---|---|
| Median length | 6 |
| Mean length | 6.8331011 |
| Min length | 6 |
Characters and Unicode
| Total characters | 1363354 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White |
|---|---|
| 2nd row | Asian or Pacific Islander |
| 3rd row | White |
| 4th row | White |
| 5th row | Amer Indian Aleut or Eskimo |
Common Values
| Value | Count | Frequency (%) |
| White | 167364 | |
| Black | 20415 | 10.2% |
| Asian or Pacific Islander | 5835 | 2.9% |
| Other | 3657 | 1.8% |
| Amer Indian Aleut or Eskimo | 2251 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| white | 167364 | |
| black | 20415 | 9.0% |
| or | 8086 | 3.6% |
| asian | 5835 | 2.6% |
| pacific | 5835 | 2.6% |
| islander | 5835 | 2.6% |
| other | 3657 | 1.6% |
| amer | 2251 | 1.0% |
| indian | 2251 | 1.0% |
| aleut | 2251 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 226031 | ||
| i | 189371 | |
| e | 181358 | |
| t | 173272 | |
| h | 171021 | |
| W | 167364 | |
| a | 40171 | 2.9% |
| c | 32085 | 2.4% |
| l | 28501 | 2.1% |
| k | 22666 | 1.7% |
| Other values (14) | 131514 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 919378 | |
| Space Separator | 226031 | 16.6% |
| Uppercase Letter | 217945 | 16.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 189371 | |
| e | 181358 | |
| t | 173272 | |
| h | 171021 | |
| a | 40171 | 4.4% |
| c | 32085 | 3.5% |
| l | 28501 | 3.1% |
| k | 22666 | 2.5% |
| r | 19829 | 2.2% |
| n | 16172 | 1.8% |
| Other values (6) | 44932 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 167364 | |
| B | 20415 | 9.4% |
| A | 10337 | 4.7% |
| I | 8086 | 3.7% |
| P | 5835 | 2.7% |
| O | 3657 | 1.7% |
| E | 2251 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 226031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1137323 | |
| Common | 226031 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 189371 | |
| e | 181358 | |
| t | 173272 | |
| h | 171021 | |
| W | 167364 | |
| a | 40171 | 3.5% |
| c | 32085 | 2.8% |
| l | 28501 | 2.5% |
| k | 22666 | 2.0% |
| B | 20415 | 1.8% |
| Other values (13) | 111099 |
Common
| Value | Count | Frequency (%) |
| 226031 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1363354 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 226031 | ||
| i | 189371 | |
| e | 181358 | |
| t | 173272 | |
| h | 171021 | |
| W | 167364 | |
| a | 40171 | 2.9% |
| c | 32085 | 2.4% |
| l | 28501 | 2.1% |
| k | 22666 | 1.7% |
| Other values (14) | 131514 |
hispanic
Categorical
IMBALANCE 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| All other | |
|---|---|
| Mexican-American | 8079 |
| Mexican (Mexicano) | 7234 |
| Central or South American | 3895 |
| Puerto Rican | 3313 |
| Other values (5) | 5095 |
Length
| Max length | 26 |
|---|---|
| Median length | 10 |
| Mean length | 10.968515 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2188460 |
|---|---|
| Distinct characters | 31 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | All other |
|---|---|
| 2nd row | All other |
| 3rd row | All other |
| 4th row | All other |
| 5th row | All other |
Common Values
| Value | Count | Frequency (%) |
| All other | 171906 | |
| Mexican-American | 8079 | 4.0% |
| Mexican (Mexicano) | 7234 | 3.6% |
| Central or South American | 3895 | 2.0% |
| Puerto Rican | 3313 | 1.7% |
| Other Spanish | 2485 | 1.2% |
| Cuban | 1126 | 0.6% |
| NA | 874 | 0.4% |
| Do not know | 306 | 0.2% |
| Chicano | 304 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| other | 174391 | |
| all | 171906 | |
| mexican-american | 8079 | 2.0% |
| mexican | 7234 | 1.8% |
| mexicano | 7234 | 1.8% |
| central | 3895 | 1.0% |
| or | 3895 | 1.0% |
| south | 3895 | 1.0% |
| american | 3895 | 1.0% |
| rican | 3313 | 0.8% |
| Other values (8) | 9020 | 2.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 396757 | ||
| l | 347707 | |
| e | 216120 | |
| r | 197468 | |
| o | 191465 | |
| t | 185800 | |
| A | 184754 | |
| h | 181075 | |
| n | 46256 | 2.1% |
| a | 45644 | 2.1% |
| Other values (21) | 195414 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1539859 | |
| Space Separator | 396757 | 18.1% |
| Uppercase Letter | 229297 | 10.5% |
| Dash Punctuation | 8079 | 0.4% |
| Open Punctuation | 7234 | 0.3% |
| Close Punctuation | 7234 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 347707 | |
| e | 216120 | |
| r | 197468 | |
| o | 191465 | |
| t | 185800 | |
| h | 181075 | |
| n | 46256 | 3.0% |
| a | 45644 | 3.0% |
| i | 40623 | 2.6% |
| c | 38138 | 2.5% |
| Other values (8) | 49563 | 3.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 184754 | |
| M | 22547 | 9.8% |
| S | 6380 | 2.8% |
| C | 5325 | 2.3% |
| P | 3313 | 1.4% |
| R | 3313 | 1.4% |
| O | 2485 | 1.1% |
| N | 874 | 0.4% |
| D | 306 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 396757 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8079 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 7234 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 7234 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1769156 | |
| Common | 419304 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 347707 | |
| e | 216120 | |
| r | 197468 | |
| o | 191465 | |
| t | 185800 | |
| A | 184754 | |
| h | 181075 | |
| n | 46256 | 2.6% |
| a | 45644 | 2.6% |
| i | 40623 | 2.3% |
| Other values (17) | 132244 | 7.5% |
Common
| Value | Count | Frequency (%) |
| 396757 | ||
| - | 8079 | 1.9% |
| ( | 7234 | 1.7% |
| ) | 7234 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2188460 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 396757 | ||
| l | 347707 | |
| e | 216120 | |
| r | 197468 | |
| o | 191465 | |
| t | 185800 | |
| A | 184754 | |
| h | 181075 | |
| n | 46256 | 2.1% |
| a | 45644 | 2.1% |
| Other values (21) | 195414 |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.0423211 |
| Min length | 5 |
Characters and Unicode
| Total characters | 1205576 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Male |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Female | 103983 | |
| Male | 95539 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 103983 | |
| male | 95539 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 303505 | |
| 199522 | ||
| a | 199522 | |
| l | 199522 | |
| F | 103983 | 8.6% |
| m | 103983 | 8.6% |
| M | 95539 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 806532 | |
| Space Separator | 199522 | 16.5% |
| Uppercase Letter | 199522 | 16.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 303505 | |
| a | 199522 | |
| l | 199522 | |
| m | 103983 | 12.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 103983 | |
| M | 95539 |
Space Separator
| Value | Count | Frequency (%) |
| 199522 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1006054 | |
| Common | 199522 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 303505 | |
| a | 199522 | |
| l | 199522 | |
| F | 103983 | 10.3% |
| m | 103983 | 10.3% |
| M | 95539 | 9.5% |
Common
| Value | Count | Frequency (%) |
| 199522 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1205576 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 303505 | |
| 199522 | ||
| a | 199522 | |
| l | 199522 | |
| F | 103983 | 8.6% |
| m | 103983 | 8.6% |
| M | 95539 | 7.9% |
labor_union
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| No | 16034 |
| Yes | 3030 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 14.773058 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2947550 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | No |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 180458 | |
| No | 16034 | 8.0% |
| Yes | 3030 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 180458 | |
| in | 180458 | |
| universe | 180458 | |
| no | 16034 | 2.9% |
| yes | 3030 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 560438 | ||
| e | 363946 | |
| i | 360916 | |
| n | 360916 | |
| N | 196492 | 6.7% |
| o | 196492 | 6.7% |
| s | 183488 | 6.2% |
| t | 180458 | 6.1% |
| u | 180458 | 6.1% |
| v | 180458 | 6.1% |
| Other values (2) | 183488 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2187590 | |
| Space Separator | 560438 | 19.0% |
| Uppercase Letter | 199522 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 363946 | |
| i | 360916 | |
| n | 360916 | |
| o | 196492 | |
| s | 183488 | |
| t | 180458 | |
| u | 180458 | |
| v | 180458 | |
| r | 180458 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 196492 | |
| Y | 3030 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 560438 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2387112 | |
| Common | 560438 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 363946 | |
| i | 360916 | |
| n | 360916 | |
| N | 196492 | |
| o | 196492 | |
| s | 183488 | |
| t | 180458 | |
| u | 180458 | |
| v | 180458 | |
| r | 180458 |
Common
| Value | Count | Frequency (%) |
| 560438 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2947550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 560438 | ||
| e | 363946 | |
| i | 360916 | |
| n | 360916 | |
| N | 196492 | 6.7% |
| o | 196492 | 6.7% |
| s | 183488 | 6.2% |
| t | 180458 | 6.1% |
| u | 180458 | 6.1% |
| v | 180458 | 6.1% |
| Other values (2) | 183488 | 6.2% |
reason_unemployment
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Other job loser | 2038 |
| Re-entrant | 2019 |
| Job loser - on layoff | 976 |
| Job leaver | 598 |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 15.954967 |
| Min length | 11 |
Characters and Unicode
| Total characters | 3183367 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 193452 | |
| Other job loser | 2038 | 1.0% |
| Re-entrant | 2019 | 1.0% |
| Job loser - on layoff | 976 | 0.5% |
| Job leaver | 598 | 0.3% |
| New entrant | 439 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 193452 | |
| in | 193452 | |
| universe | 193452 | |
| job | 3612 | 0.6% |
| loser | 3014 | 0.5% |
| other | 2038 | 0.3% |
| re-entrant | 2019 | 0.3% |
| 976 | 0.2% | |
| on | 976 | 0.2% |
| layoff | 976 | 0.2% |
| Other values (3) | 1476 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 595443 | ||
| e | 398068 | |
| n | 392796 | |
| i | 386904 | |
| o | 202030 | 6.3% |
| r | 201560 | 6.3% |
| t | 200406 | 6.3% |
| s | 196466 | 6.2% |
| v | 194050 | 6.1% |
| N | 193891 | 6.1% |
| Other values (13) | 221753 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2385407 | |
| Space Separator | 595443 | 18.7% |
| Uppercase Letter | 199522 | 6.3% |
| Dash Punctuation | 2995 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 398068 | |
| n | 392796 | |
| i | 386904 | |
| o | 202030 | |
| r | 201560 | |
| t | 200406 | |
| s | 196466 | |
| v | 194050 | |
| u | 193452 | |
| l | 4588 | 0.2% |
| Other values (7) | 15087 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 193891 | |
| O | 2038 | 1.0% |
| R | 2019 | 1.0% |
| J | 1574 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 595443 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2995 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2584929 | |
| Common | 598438 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 398068 | |
| n | 392796 | |
| i | 386904 | |
| o | 202030 | |
| r | 201560 | |
| t | 200406 | |
| s | 196466 | |
| v | 194050 | |
| N | 193891 | |
| u | 193452 | |
| Other values (11) | 25306 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 595443 | ||
| - | 2995 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3183367 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 595443 | ||
| e | 398068 | |
| n | 392796 | |
| i | 386904 | |
| o | 202030 | 6.3% |
| r | 201560 | 6.3% |
| t | 200406 | 6.3% |
| s | 196466 | 6.2% |
| v | 194050 | 6.1% |
| N | 193891 | 6.1% |
| Other values (13) | 221753 | 7.0% |
employment_type
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Children or Armed Forces | |
|---|---|
| Full-time schedules | |
| Not in labor force | |
| PT for non-econ reasons usually FT | 3322 |
| Unemployed full-time | 2311 |
| Other values (3) | 2577 |
Length
| Max length | 35 |
|---|---|
| Median length | 25 |
| Mean length | 23.33266 |
| Min length | 19 |
Characters and Unicode
| Total characters | 4655379 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Children or Armed Forces |
|---|---|
| 2nd row | Not in labor force |
| 3rd row | Children or Armed Forces |
| 4th row | Children or Armed Forces |
| 5th row | Full-time schedules |
Common Values
| Value | Count | Frequency (%) |
| Children or Armed Forces | 123769 | |
| Full-time schedules | 40736 | 20.4% |
| Not in labor force | 26807 | 13.4% |
| PT for non-econ reasons usually FT | 3322 | 1.7% |
| Unemployed full-time | 2311 | 1.2% |
| PT for econ reasons usually PT | 1209 | 0.6% |
| Unemployed part- time | 843 | 0.4% |
| PT for econ reasons usually FT | 525 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| children | 123769 | |
| or | 123769 | |
| armed | 123769 | |
| forces | 123769 | |
| full-time | 43047 | 6.0% |
| schedules | 40736 | 5.6% |
| not | 26807 | 3.7% |
| in | 26807 | 3.7% |
| labor | 26807 | 3.7% |
| force | 26807 | 3.7% |
| Other values (10) | 35176 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 721263 | ||
| r | 559645 | |
| e | 539896 | |
| o | 349603 | 7.5% |
| d | 291428 | 6.3% |
| l | 290672 | 6.2% |
| s | 220409 | 4.7% |
| c | 196368 | 4.2% |
| i | 194466 | 4.2% |
| m | 170813 | 3.7% |
| Other values (17) | 1120816 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3424676 | |
| Space Separator | 721263 | 15.5% |
| Uppercase Letter | 462228 | 9.9% |
| Dash Punctuation | 47212 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 559645 | |
| e | 539896 | |
| o | 349603 | |
| d | 291428 | |
| l | 290672 | |
| s | 220409 | 6.4% |
| c | 196368 | 5.7% |
| i | 194466 | 5.7% |
| m | 170813 | 5.0% |
| n | 170486 | 5.0% |
| Other values (8) | 440890 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 168352 | |
| A | 123769 | |
| C | 123769 | |
| N | 26807 | 5.8% |
| T | 10112 | 2.2% |
| P | 6265 | 1.4% |
| U | 3154 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 721263 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3886904 | |
| Common | 768475 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 559645 | |
| e | 539896 | |
| o | 349603 | 9.0% |
| d | 291428 | 7.5% |
| l | 290672 | 7.5% |
| s | 220409 | 5.7% |
| c | 196368 | 5.1% |
| i | 194466 | 5.0% |
| m | 170813 | 4.4% |
| n | 170486 | 4.4% |
| Other values (15) | 903118 |
Common
| Value | Count | Frequency (%) |
| 721263 | ||
| - | 47212 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4655379 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 721263 | ||
| r | 559645 | |
| e | 539896 | |
| o | 349603 | 7.5% |
| d | 291428 | 6.3% |
| l | 290672 | 6.2% |
| s | 220409 | 4.7% |
| c | 196368 | 4.2% |
| i | 194466 | 4.2% |
| m | 170813 | 3.7% |
| Other values (17) | 1120816 |
gains
Real number (ℝ)
ZEROS 
| Distinct | 132 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 434.72117 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 192143 |
| Zeros (%) | 96.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4697.543 |
|---|---|
| Coefficient of variation (CV) | 10.805876 |
| Kurtosis | 393.06085 |
| Mean | 434.72117 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 18.990775 |
| Sum | 86736437 |
| Variance | 22066910 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 192143 | |
| 15024 | 788 | 0.4% |
| 7688 | 609 | 0.3% |
| 7298 | 582 | 0.3% |
| 99999 | 390 | 0.2% |
| 3103 | 237 | 0.1% |
| 5178 | 207 | 0.1% |
| 5013 | 158 | 0.1% |
| 4386 | 151 | 0.1% |
| 3325 | 121 | 0.1% |
| Other values (122) | 4136 | 2.1% |
| Value | Count | Frequency (%) |
| 0 | 192143 | |
| 114 | 11 | < 0.1% |
| 401 | 33 | < 0.1% |
| 594 | 88 | < 0.1% |
| 914 | 17 | < 0.1% |
| 991 | 59 | < 0.1% |
| 1055 | 69 | < 0.1% |
| 1086 | 81 | < 0.1% |
| 1090 | 2 | < 0.1% |
| 1111 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 390 | |
| 41310 | 2 | < 0.1% |
| 34095 | 11 | < 0.1% |
| 27828 | 94 | < 0.1% |
| 25236 | 23 | < 0.1% |
| 25124 | 18 | < 0.1% |
| 22040 | 2 | < 0.1% |
| 20051 | 91 | < 0.1% |
| 18481 | 14 | < 0.1% |
| 15831 | 16 | < 0.1% |
losses
Real number (ℝ)
ZEROS 
| Distinct | 113 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.313975 |
| Minimum | 0 |
|---|---|
| Maximum | 4608 |
| Zeros | 195616 |
| Zeros (%) | 98.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 4608 |
| Range | 4608 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 271.8971 |
|---|---|
| Coefficient of variation (CV) | 7.2867362 |
| Kurtosis | 61.6326 |
| Mean | 37.313975 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 7.6325446 |
| Sum | 7444959 |
| Variance | 73928.031 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 195616 | |
| 1902 | 407 | 0.2% |
| 1977 | 381 | 0.2% |
| 1887 | 364 | 0.2% |
| 1602 | 193 | 0.1% |
| 2415 | 122 | 0.1% |
| 1485 | 95 | < 0.1% |
| 1848 | 88 | < 0.1% |
| 1876 | 87 | < 0.1% |
| 1672 | 85 | < 0.1% |
| Other values (103) | 2084 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 195616 | |
| 155 | 1 | < 0.1% |
| 213 | 10 | < 0.1% |
| 323 | 10 | < 0.1% |
| 419 | 29 | < 0.1% |
| 625 | 25 | < 0.1% |
| 653 | 7 | < 0.1% |
| 772 | 5 | < 0.1% |
| 810 | 5 | < 0.1% |
| 880 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 4608 | 4 | < 0.1% |
| 4356 | 30 | |
| 3900 | 2 | < 0.1% |
| 3770 | 5 | < 0.1% |
| 3683 | 4 | < 0.1% |
| 3500 | 10 | < 0.1% |
| 3175 | 8 | < 0.1% |
| 3004 | 11 | < 0.1% |
| 2824 | 27 | |
| 2788 | 7 | < 0.1% |
divdends
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 1478 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 197.53052 |
| Minimum | 0 |
|---|---|
| Maximum | 99999 |
| Zeros | 178381 |
| Zeros (%) | 89.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 400 |
| Maximum | 99999 |
| Range | 99999 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1984.1686 |
|---|---|
| Coefficient of variation (CV) | 10.044871 |
| Kurtosis | 1090.5583 |
| Mean | 197.53052 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.786433 |
| Sum | 39411685 |
| Variance | 3936925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 178381 | |
| 100 | 1148 | 0.6% |
| 500 | 1030 | 0.5% |
| 1000 | 894 | 0.4% |
| 200 | 866 | 0.4% |
| 50 | 832 | 0.4% |
| 2000 | 574 | 0.3% |
| 250 | 555 | 0.3% |
| 150 | 549 | 0.3% |
| 300 | 523 | 0.3% |
| Other values (1468) | 14170 | 7.1% |
| Value | Count | Frequency (%) |
| 0 | 178381 | |
| 1 | 472 | 0.2% |
| 2 | 193 | 0.1% |
| 3 | 129 | 0.1% |
| 4 | 75 | < 0.1% |
| 5 | 179 | 0.1% |
| 6 | 100 | 0.1% |
| 7 | 93 | < 0.1% |
| 8 | 94 | < 0.1% |
| 9 | 56 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999 | 25 | |
| 95095 | 1 | < 0.1% |
| 75000 | 5 | < 0.1% |
| 70000 | 3 | < 0.1% |
| 66621 | 2 | < 0.1% |
| 60000 | 7 | < 0.1% |
| 57678 | 1 | < 0.1% |
| 55000 | 1 | < 0.1% |
| 54600 | 2 | < 0.1% |
| 54500 | 2 | < 0.1% |
liability
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Nonfiler | |
|---|---|
| Joint both under 65 | |
| Single | |
| Joint both 65+ | |
| Head of household | 7426 |
Length
| Max length | 29 |
|---|---|
| Median length | 20 |
| Mean length | 13.312993 |
| Min length | 7 |
Characters and Unicode
| Total characters | 2656235 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Head of household |
|---|---|
| 2nd row | Nonfiler |
| 3rd row | Nonfiler |
| 4th row | Nonfiler |
| 5th row | Joint both under 65 |
Common Values
| Value | Count | Frequency (%) |
| Nonfiler | 75093 | |
| Joint both under 65 | 67383 | |
| Single | 37421 | |
| Joint both 65+ | 8332 | 4.2% |
| Head of household | 7426 | 3.7% |
| Joint one under 65 & one 65+ | 3867 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 65 | 83449 | |
| joint | 79582 | |
| both | 75715 | |
| nonfiler | 75093 | |
| under | 71250 | |
| single | 37421 | |
| one | 7734 | 1.7% |
| head | 7426 | 1.6% |
| of | 7426 | 1.6% |
| household | 7426 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 456389 | ||
| n | 271080 | |
| o | 260402 | 9.8% |
| e | 206350 | 7.8% |
| i | 192096 | 7.2% |
| t | 155297 | 5.8% |
| r | 146343 | 5.5% |
| l | 119940 | 4.5% |
| h | 90567 | 3.4% |
| d | 86102 | 3.2% |
| Other values (14) | 671669 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1817360 | |
| Space Separator | 456389 | 17.2% |
| Uppercase Letter | 199522 | 7.5% |
| Decimal Number | 166898 | 6.3% |
| Math Symbol | 12199 | 0.5% |
| Other Punctuation | 3867 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 271080 | |
| o | 260402 | |
| e | 206350 | |
| i | 192096 | |
| t | 155297 | |
| r | 146343 | |
| l | 119940 | |
| h | 90567 | 5.0% |
| d | 86102 | 4.7% |
| f | 82519 | 4.5% |
| Other values (5) | 206664 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 79582 | |
| N | 75093 | |
| S | 37421 | |
| H | 7426 | 3.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 83449 | |
| 5 | 83449 |
Space Separator
| Value | Count | Frequency (%) |
| 456389 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 12199 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 3867 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2016882 | |
| Common | 639353 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 271080 | |
| o | 260402 | |
| e | 206350 | |
| i | 192096 | |
| t | 155297 | 7.7% |
| r | 146343 | 7.3% |
| l | 119940 | 5.9% |
| h | 90567 | 4.5% |
| d | 86102 | 4.3% |
| f | 82519 | 4.1% |
| Other values (9) | 406186 |
Common
| Value | Count | Frequency (%) |
| 456389 | ||
| 6 | 83449 | 13.1% |
| 5 | 83449 | 13.1% |
| + | 12199 | 1.9% |
| & | 3867 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2656235 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 456389 | ||
| n | 271080 | |
| o | 260402 | 9.8% |
| e | 206350 | 7.8% |
| i | 192096 | 7.2% |
| t | 155297 | 5.8% |
| r | 146343 | 5.5% |
| l | 119940 | 4.5% |
| h | 90567 | 3.4% |
| d | 86102 | 3.2% |
| Other values (14) | 671669 |
state_residence
Text
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 15.456872 |
| Min length | 2 |
Characters and Unicode
| Total characters | 3083986 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Arkansas |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
| Value | Count | Frequency (%) |
| not | 183749 | |
| universe | 183749 | |
| in | 183749 | |
| california | 1714 | 0.3% |
| north | 1311 | 0.2% |
| utah | 1063 | 0.2% |
| new | 975 | 0.2% |
| carolina | 907 | 0.2% |
| florida | 849 | 0.1% |
| 708 | 0.1% | |
| Other values (46) | 11228 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 570002 | ||
| i | 380322 | |
| n | 377216 | |
| e | 373182 | |
| o | 195444 | 6.3% |
| r | 192089 | 6.2% |
| s | 189329 | 6.1% |
| t | 189229 | 6.1% |
| N | 186387 | 6.0% |
| u | 184977 | 6.0% |
| Other values (36) | 245809 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2311596 | |
| Space Separator | 570002 | 18.5% |
| Uppercase Letter | 201680 | 6.5% |
| Other Punctuation | 708 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 380322 | |
| n | 377216 | |
| e | 373182 | |
| o | 195444 | |
| r | 192089 | |
| s | 189329 | |
| t | 189229 | |
| u | 184977 | |
| v | 184122 | |
| a | 19048 | 0.8% |
| Other values (14) | 26638 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 186387 | |
| C | 3093 | 1.5% |
| M | 2539 | 1.3% |
| A | 1625 | 0.8% |
| O | 1073 | 0.5% |
| U | 1063 | 0.5% |
| I | 933 | 0.5% |
| F | 849 | 0.4% |
| D | 826 | 0.4% |
| W | 577 | 0.3% |
| Other values (10) | 2715 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 570002 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 708 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2513276 | |
| Common | 570710 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 380322 | |
| n | 377216 | |
| e | 373182 | |
| o | 195444 | |
| r | 192089 | |
| s | 189329 | |
| t | 189229 | |
| N | 186387 | |
| u | 184977 | |
| v | 184122 | |
| Other values (34) | 60979 | 2.4% |
Common
| Value | Count | Frequency (%) |
| 570002 | ||
| ? | 708 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3083986 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 570002 | ||
| i | 380322 | |
| n | 377216 | |
| e | 373182 | |
| o | 195444 | 6.3% |
| r | 192089 | 6.2% |
| s | 189329 | 6.1% |
| t | 189229 | 6.1% |
| N | 186387 | 6.0% |
| u | 184977 | 6.0% |
| Other values (36) | 245809 |
household_summary
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Householder | |
|---|---|
| Child under 18 never married | |
| Spouse of householder | |
| Child 18 or older | |
| Other relative of householder | |
| Other values (3) |
Length
| Max length | 37 |
|---|---|
| Median length | 30 |
| Mean length | 20.287883 |
| Min length | 12 |
Characters and Unicode
| Total characters | 4047879 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Householder |
|---|---|
| 2nd row | Child 18 or older |
| 3rd row | Child under 18 never married |
| 4th row | Child under 18 never married |
| 5th row | Spouse of householder |
Common Values
| Value | Count | Frequency (%) |
| Householder | 75475 | |
| Child under 18 never married | 50426 | |
| Spouse of householder | 41709 | |
| Child 18 or older | 14430 | 7.2% |
| Other relative of householder | 9702 | 4.9% |
| Nonrelative of householder | 7601 | 3.8% |
| Group Quarters- Secondary individual | 132 | 0.1% |
| Child under 18 ever married | 47 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| householder | 134487 | |
| child | 64903 | |
| 18 | 64903 | |
| of | 59012 | |
| under | 50473 | 8.8% |
| married | 50473 | 8.8% |
| never | 50426 | 8.8% |
| spouse | 41709 | 7.3% |
| older | 14430 | 2.5% |
| or | 14430 | 2.5% |
| Other values (8) | 27580 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 572826 | ||
| e | 571577 | |
| o | 406420 | |
| r | 392772 | |
| d | 315162 | |
| h | 268104 | 6.6% |
| l | 231255 | 5.7% |
| u | 227065 | 5.6% |
| s | 176328 | 4.4% |
| i | 133075 | 3.3% |
| Other values (19) | 753295 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3145329 | |
| Space Separator | 572826 | 14.2% |
| Uppercase Letter | 199786 | 4.9% |
| Decimal Number | 129806 | 3.2% |
| Dash Punctuation | 132 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 571577 | |
| o | 406420 | |
| r | 392772 | |
| d | 315162 | |
| h | 268104 | |
| l | 231255 | |
| u | 227065 | 7.2% |
| s | 176328 | 5.6% |
| i | 133075 | 4.2% |
| n | 108764 | 3.5% |
| Other values (8) | 314807 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 75475 | |
| C | 64903 | |
| S | 41841 | |
| O | 9702 | 4.9% |
| N | 7601 | 3.8% |
| G | 132 | 0.1% |
| Q | 132 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 64903 | |
| 1 | 64903 |
Space Separator
| Value | Count | Frequency (%) |
| 572826 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 132 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3345115 | |
| Common | 702764 | 17.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 571577 | |
| o | 406420 | |
| r | 392772 | |
| d | 315162 | |
| h | 268104 | |
| l | 231255 | |
| u | 227065 | 6.8% |
| s | 176328 | 5.3% |
| i | 133075 | 4.0% |
| n | 108764 | 3.3% |
| Other values (15) | 514593 |
Common
| Value | Count | Frequency (%) |
| 572826 | ||
| 8 | 64903 | 9.2% |
| 1 | 64903 | 9.2% |
| - | 132 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4047879 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 572826 | ||
| e | 571577 | |
| o | 406420 | |
| r | 392772 | |
| d | 315162 | |
| h | 268104 | 6.6% |
| l | 231255 | 5.7% |
| u | 227065 | 5.6% |
| s | 176328 | 4.4% |
| i | 133075 | 3.3% |
| Other values (19) | 753295 |
instance_weight
Real number (ℝ)
| Distinct | 99800 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1740.3805 |
| Minimum | 37.87 |
|---|---|
| Maximum | 18656.3 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 37.87 |
|---|---|
| 5-th percentile | 395.341 |
| Q1 | 1061.6075 |
| median | 1618.31 |
| Q3 | 2188.61 |
| 95-th percentile | 3585.9095 |
| Maximum | 18656.3 |
| Range | 18618.43 |
| Interquartile range (IQR) | 1127.0025 |
Descriptive statistics
| Standard deviation | 993.77064 |
|---|---|
| Coefficient of variation (CV) | 0.5710077 |
| Kurtosis | 5.4124708 |
| Mean | 1740.3805 |
| Median Absolute Deviation (MAD) | 561.465 |
| Skewness | 1.432729 |
| Sum | 3.4724419 × 108 |
| Variance | 987580.09 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1191.21 | 32 | < 0.1% |
| 753.23 | 32 | < 0.1% |
| 1787.34 | 32 | < 0.1% |
| 1601.4 | 32 | < 0.1% |
| 1317.51 | 31 | < 0.1% |
| 707.9 | 31 | < 0.1% |
| 1070.15 | 30 | < 0.1% |
| 1002.02 | 28 | < 0.1% |
| 1839.19 | 28 | < 0.1% |
| 1033.83 | 28 | < 0.1% |
| Other values (99790) | 199218 |
| Value | Count | Frequency (%) |
| 37.87 | 1 | < 0.1% |
| 39.11 | 1 | < 0.1% |
| 40.67 | 2 | < 0.1% |
| 42.82 | 2 | < 0.1% |
| 43.26 | 3 | |
| 45.74 | 2 | < 0.1% |
| 47.83 | 6 | |
| 49.82 | 2 | < 0.1% |
| 52.43 | 1 | < 0.1% |
| 52.46 | 4 |
| Value | Count | Frequency (%) |
| 18656.3 | 1 | |
| 16349.2 | 1 | |
| 13911.5 | 1 | |
| 13145.1 | 1 | |
| 13114.2 | 1 | |
| 12960.2 | 1 | |
| 12399.9 | 1 | |
| 12184.5 | 1 | |
| 11958.4 | 1 | |
| 11863 | 1 |
migration_msa
Categorical
IMBALANCE 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| MSA to MSA | |
| NonMSA to nonMSA | 2811 |
| Not in universe | 1516 |
| Other values (5) | 2361 |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 5.8412055 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1165449 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MSA to MSA |
|---|---|
| 2nd row | ? |
| 3rd row | Nonmover |
| 4th row | Nonmover |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| ? | 99695 | |
| Nonmover | 82538 | |
| MSA to MSA | 10601 | 5.3% |
| NonMSA to nonMSA | 2811 | 1.4% |
| Not in universe | 1516 | 0.8% |
| MSA to nonMSA | 790 | 0.4% |
| NonMSA to MSA | 615 | 0.3% |
| Abroad to MSA | 453 | 0.2% |
| Not identifiable | 430 | 0.2% |
| Abroad to nonMSA | 73 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 99695 | ||
| nonmover | 82538 | |
| msa | 23060 | 9.9% |
| to | 15343 | 6.6% |
| nonmsa | 7100 | 3.0% |
| not | 1946 | 0.8% |
| in | 1516 | 0.6% |
| universe | 1516 | 0.6% |
| abroad | 526 | 0.2% |
| identifiable | 430 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 233670 | ||
| o | 189991 | |
| ? | 99695 | |
| n | 96774 | |
| N | 87910 | 7.5% |
| e | 86430 | 7.4% |
| r | 84580 | 7.3% |
| v | 84054 | 7.2% |
| m | 82538 | 7.1% |
| A | 30686 | 2.6% |
| Other values (11) | 89121 | 7.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 653168 | |
| Space Separator | 233670 | 20.0% |
| Uppercase Letter | 178916 | 15.4% |
| Other Punctuation | 99695 | 8.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 189991 | |
| n | 96774 | |
| e | 86430 | |
| r | 84580 | |
| v | 84054 | |
| m | 82538 | |
| t | 17719 | 2.7% |
| i | 4322 | 0.7% |
| u | 1516 | 0.2% |
| s | 1516 | 0.2% |
| Other values (5) | 3728 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 87910 | |
| A | 30686 | 17.2% |
| S | 30160 | 16.9% |
| M | 30160 | 16.9% |
Space Separator
| Value | Count | Frequency (%) |
| 233670 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99695 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 832084 | |
| Common | 333365 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 189991 | |
| n | 96774 | |
| N | 87910 | |
| e | 86430 | |
| r | 84580 | |
| v | 84054 | |
| m | 82538 | |
| A | 30686 | 3.7% |
| S | 30160 | 3.6% |
| M | 30160 | 3.6% |
| Other values (9) | 28801 | 3.5% |
Common
| Value | Count | Frequency (%) |
| 233670 | ||
| ? | 99695 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1165449 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 233670 | ||
| o | 189991 | |
| ? | 99695 | |
| n | 96774 | |
| N | 87910 | 7.5% |
| e | 86430 | 7.4% |
| r | 84580 | 7.3% |
| v | 84054 | 7.2% |
| m | 82538 | 7.1% |
| A | 30686 | 2.6% |
| Other values (11) | 89121 | 7.6% |
migration_reg
Categorical
IMBALANCE 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9812 |
| Different county same state | 2797 |
| Not in universe | 1516 |
| Other values (4) | 3164 |
Length
| Max length | 31 |
|---|---|
| Median length | 30 |
| Mean length | 6.1668839 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1230429 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Same county |
|---|---|
| 2nd row | ? |
| 3rd row | Nonmover |
| 4th row | Nonmover |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| ? | 99695 | |
| Nonmover | 82538 | |
| Same county | 9812 | 4.9% |
| Different county same state | 2797 | 1.4% |
| Not in universe | 1516 | 0.8% |
| Different region | 1178 | 0.6% |
| Different state same division | 991 | 0.5% |
| Abroad | 530 | 0.3% |
| Different division same region | 465 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 99695 | ||
| nonmover | 82538 | |
| same | 14065 | 6.2% |
| county | 12609 | 5.6% |
| different | 5431 | 2.4% |
| state | 3788 | 1.7% |
| region | 1643 | 0.7% |
| not | 1516 | 0.7% |
| in | 1516 | 0.7% |
| universe | 1516 | 0.7% |
| Other values (2) | 1986 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 226303 | ||
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| ? | 99695 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | 6.9% |
| N | 84054 | 6.8% |
| t | 27132 | 2.2% |
| Other values (13) | 114007 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 804604 | |
| Space Separator | 226303 | 18.4% |
| Uppercase Letter | 99827 | 8.1% |
| Other Punctuation | 99695 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | |
| t | 27132 | 3.4% |
| a | 18383 | 2.3% |
| i | 14474 | 1.8% |
| u | 14125 | 1.8% |
| Other values (7) | 51252 | 6.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 84054 | |
| S | 9812 | 9.8% |
| D | 5431 | 5.4% |
| A | 530 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 226303 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99695 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 904431 | |
| Common | 325998 | 26.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | |
| N | 84054 | |
| t | 27132 | 3.0% |
| a | 18383 | 2.0% |
| i | 14474 | 1.6% |
| Other values (11) | 81150 |
Common
| Value | Count | Frequency (%) |
| 226303 | ||
| ? | 99695 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1230429 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 226303 | ||
| o | 182830 | |
| e | 115928 | |
| n | 106709 | |
| ? | 99695 | |
| m | 96603 | |
| r | 91658 | |
| v | 85510 | 6.9% |
| N | 84054 | 6.8% |
| t | 27132 | 2.2% |
| Other values (13) | 114007 |
migration_within
Categorical
IMBALANCE 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Nonmover | |
| Same county | 9812 |
| Different county same state | 2797 |
| Not in universe | 1516 |
| Other values (5) | 3164 |
Length
| Max length | 29 |
|---|---|
| Median length | 28 |
| Mean length | 6.1860597 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1234255 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Same county |
|---|---|
| 2nd row | ? |
| 3rd row | Nonmover |
| 4th row | Nonmover |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| ? | 99695 | |
| Nonmover | 82538 | |
| Same county | 9812 | 4.9% |
| Different county same state | 2797 | 1.4% |
| Not in universe | 1516 | 0.8% |
| Different state in South | 973 | 0.5% |
| Different state in West | 679 | 0.3% |
| Different state in Midwest | 551 | 0.3% |
| Abroad | 530 | 0.3% |
| Different state in Northeast | 431 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 99695 | ||
| nonmover | 82538 | |
| same | 12609 | 5.5% |
| county | 12609 | 5.5% |
| different | 5431 | 2.4% |
| state | 5431 | 2.4% |
| in | 4150 | 1.8% |
| not | 1516 | 0.7% |
| universe | 1516 | 0.7% |
| south | 973 | 0.4% |
| Other values (4) | 2191 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 228659 | ||
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| ? | 99695 | |
| m | 95147 | |
| r | 90446 | 7.3% |
| N | 84485 | 6.8% |
| v | 84054 | 6.8% |
| t | 33483 | 2.7% |
| Other values (16) | 114774 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 803440 | |
| Space Separator | 228659 | 18.5% |
| Uppercase Letter | 102461 | 8.3% |
| Other Punctuation | 99695 | 8.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| m | 95147 | |
| r | 90446 | |
| v | 84054 | |
| t | 33483 | 4.2% |
| a | 19001 | 2.4% |
| u | 15098 | 1.9% |
| c | 12609 | 1.6% |
| Other values (8) | 50090 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 84485 | |
| S | 10785 | 10.5% |
| D | 5431 | 5.3% |
| W | 679 | 0.7% |
| M | 551 | 0.5% |
| A | 530 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 228659 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99695 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 905901 | |
| Common | 328354 | 26.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| m | 95147 | |
| r | 90446 | |
| N | 84485 | |
| v | 84054 | |
| t | 33483 | 3.7% |
| a | 19001 | 2.1% |
| u | 15098 | 1.7% |
| Other values (14) | 80675 |
Common
| Value | Count | Frequency (%) |
| 228659 | ||
| ? | 99695 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1234255 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 228659 | ||
| o | 181135 | |
| e | 116133 | |
| n | 106244 | |
| ? | 99695 | |
| m | 95147 | |
| r | 90446 | 7.3% |
| N | 84485 | 6.8% |
| v | 84054 | 6.8% |
| t | 33483 | 2.7% |
| Other values (16) | 114774 |
live_one_year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe under 1 year old | |
|---|---|
| Yes | |
| No |
Length
| Max length | 33 |
|---|---|
| Median length | 33 |
| Mean length | 18.6317 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3717434 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No |
|---|---|
| 2nd row | Not in universe under 1 year old |
| 3rd row | Yes |
| 4th row | Yes |
| 5th row | Not in universe under 1 year old |
Common Values
| Value | Count | Frequency (%) |
| Not in universe under 1 year old | 101211 | |
| Yes | 82538 | |
| No | 15773 | 7.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 101211 | |
| in | 101211 | |
| universe | 101211 | |
| under | 101211 | |
| 1 | 101211 | |
| year | 101211 | |
| old | 101211 | |
| yes | 82538 | |
| no | 15773 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 806788 | ||
| e | 487382 | |
| n | 303633 | 8.2% |
| r | 303633 | 8.2% |
| o | 218195 | 5.9% |
| i | 202422 | 5.4% |
| u | 202422 | 5.4% |
| d | 202422 | 5.4% |
| s | 183749 | 4.9% |
| N | 116984 | 3.1% |
| Other values (7) | 689804 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2609913 | |
| Space Separator | 806788 | 21.7% |
| Uppercase Letter | 199522 | 5.4% |
| Decimal Number | 101211 | 2.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 487382 | |
| n | 303633 | |
| r | 303633 | |
| o | 218195 | |
| i | 202422 | |
| u | 202422 | |
| d | 202422 | |
| s | 183749 | 7.0% |
| t | 101211 | 3.9% |
| v | 101211 | 3.9% |
| Other values (3) | 303633 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 116984 | |
| Y | 82538 |
Space Separator
| Value | Count | Frequency (%) |
| 806788 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 101211 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2809435 | |
| Common | 907999 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 487382 | |
| n | 303633 | |
| r | 303633 | |
| o | 218195 | |
| i | 202422 | |
| u | 202422 | |
| d | 202422 | |
| s | 183749 | 6.5% |
| N | 116984 | 4.2% |
| t | 101211 | 3.6% |
| Other values (5) | 487382 |
Common
| Value | Count | Frequency (%) |
| 806788 | ||
| 1 | 101211 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3717434 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 806788 | ||
| e | 487382 | |
| n | 303633 | 8.2% |
| r | 303633 | 8.2% |
| o | 218195 | 5.9% |
| i | 202422 | 5.4% |
| u | 202422 | 5.4% |
| d | 202422 | 5.4% |
| s | 183749 | 4.9% |
| N | 116984 | 3.1% |
| Other values (7) | 689804 |
sunbelt
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| ? | |
|---|---|
| Not in universe | |
| No | |
| Yes | 5786 |
Length
| Max length | 16 |
|---|---|
| Median length | 4 |
| Mean length | 8.0059292 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1597359 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yes |
|---|---|
| 2nd row | ? |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | ? |
Common Values
| Value | Count | Frequency (%) |
| ? | 99695 | |
| Not in universe | 84054 | |
| No | 9987 | 5.0% |
| Yes | 5786 | 2.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 99695 | ||
| not | 84054 | |
| in | 84054 | |
| universe | 84054 | |
| no | 9987 | 2.7% |
| yes | 5786 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 367630 | ||
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| ? | 99695 | 6.2% |
| N | 94041 | 5.9% |
| o | 94041 | 5.9% |
| s | 89840 | 5.6% |
| t | 84054 | 5.3% |
| u | 84054 | 5.3% |
| Other values (3) | 173894 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1030207 | |
| Space Separator | 367630 | 23.0% |
| Uppercase Letter | 99827 | 6.2% |
| Other Punctuation | 99695 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| o | 94041 | |
| s | 89840 | |
| t | 84054 | |
| u | 84054 | |
| v | 84054 | |
| r | 84054 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 94041 | |
| Y | 5786 | 5.8% |
Space Separator
| Value | Count | Frequency (%) |
| 367630 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 99695 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1130034 | |
| Common | 467325 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| N | 94041 | |
| o | 94041 | |
| s | 89840 | |
| t | 84054 | |
| u | 84054 | |
| v | 84054 | |
| r | 84054 |
Common
| Value | Count | Frequency (%) |
| 367630 | ||
| ? | 99695 | 21.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1597359 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 367630 | ||
| e | 173894 | |
| i | 168108 | |
| n | 168108 | |
| ? | 99695 | 6.2% |
| N | 94041 | 5.9% |
| o | 94041 | 5.9% |
| s | 89840 | 5.6% |
| t | 84054 | 5.3% |
| u | 84054 | 5.3% |
| Other values (3) | 173894 |
person_worked
Real number (ℝ)
ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9561903 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 95982 |
| Zeros (%) | 48.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.3651274 |
|---|---|
| Coefficient of variation (CV) | 1.2090477 |
| Kurtosis | -1.0822581 |
| Mean | 1.9561903 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.75155306 |
| Sum | 390303 |
| Variance | 5.5938275 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95982 | |
| 6 | 36511 | 18.3% |
| 1 | 23109 | 11.6% |
| 4 | 14379 | 7.2% |
| 3 | 13425 | 6.7% |
| 2 | 10081 | 5.1% |
| 5 | 6035 | 3.0% |
| Value | Count | Frequency (%) |
| 0 | 95982 | |
| 1 | 23109 | 11.6% |
| 2 | 10081 | 5.1% |
| 3 | 13425 | 6.7% |
| 4 | 14379 | 7.2% |
| 5 | 6035 | 3.0% |
| 6 | 36511 | 18.3% |
| Value | Count | Frequency (%) |
| 6 | 36511 | 18.3% |
| 5 | 6035 | 3.0% |
| 4 | 14379 | 7.2% |
| 3 | 13425 | 6.7% |
| 2 | 10081 | 5.1% |
| 1 | 23109 | 11.6% |
| 0 | 95982 |
under18
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| Both parents present | |
| Mother only present | 12772 |
| Father only present | 1883 |
| Neither parent present | 1653 |
Length
| Max length | 23 |
|---|---|
| Median length | 16 |
| Mean length | 17.328706 |
| Min length | 16 |
Characters and Unicode
| Total characters | 3457458 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Both parents present |
| 4th row | Both parents present |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 144231 | |
| Both parents present | 38983 | 19.5% |
| Mother only present | 12772 | 6.4% |
| Father only present | 1883 | 0.9% |
| Neither parent present | 1653 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 144231 | |
| in | 144231 | |
| universe | 144231 | |
| present | 55291 | 9.2% |
| both | 38983 | 6.5% |
| parents | 38983 | 6.5% |
| only | 14655 | 2.4% |
| mother | 12772 | 2.1% |
| father | 1883 | 0.3% |
| neither | 1653 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 598566 | ||
| e | 457641 | |
| n | 399044 | |
| t | 295449 | |
| i | 290115 | |
| r | 256466 | |
| s | 238505 | 6.9% |
| o | 210641 | 6.1% |
| N | 145884 | 4.2% |
| u | 144231 | 4.2% |
| Other values (9) | 420916 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2659370 | |
| Space Separator | 598566 | 17.3% |
| Uppercase Letter | 199522 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 457641 | |
| n | 399044 | |
| t | 295449 | |
| i | 290115 | |
| r | 256466 | |
| s | 238505 | |
| o | 210641 | |
| u | 144231 | 5.4% |
| v | 144231 | 5.4% |
| p | 95927 | 3.6% |
| Other values (4) | 127120 | 4.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 145884 | |
| B | 38983 | 19.5% |
| M | 12772 | 6.4% |
| F | 1883 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 598566 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2858892 | |
| Common | 598566 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 457641 | |
| n | 399044 | |
| t | 295449 | |
| i | 290115 | |
| r | 256466 | |
| s | 238505 | |
| o | 210641 | |
| N | 145884 | 5.1% |
| u | 144231 | 5.0% |
| v | 144231 | 5.0% |
| Other values (8) | 276685 |
Common
| Value | Count | Frequency (%) |
| 598566 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3457458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 598566 | ||
| e | 457641 | |
| n | 399044 | |
| t | 295449 | |
| i | 290115 | |
| r | 256466 | |
| s | 238505 | 6.9% |
| o | 210641 | 6.1% |
| N | 145884 | 4.2% |
| u | 144231 | 4.2% |
| Other values (9) | 420916 |
citizen
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Native- Born in the United States | |
|---|---|
| Foreign born- Not a citizen of U S | 13401 |
| Foreign born- U S citizen by naturalization | 5855 |
| Native- Born abroad of American Parent(s) | 1756 |
| Native- Born in Puerto Rico or U S Outlying | 1519 |
Length
| Max length | 44 |
|---|---|
| Median length | 34 |
| Mean length | 34.574323 |
| Min length | 34 |
Characters and Unicode
| Total characters | 6898338 |
|---|---|
| Distinct characters | 33 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Native- Born in the United States |
|---|---|
| 2nd row | Foreign born- Not a citizen of U S |
| 3rd row | Native- Born in the United States |
| 4th row | Native- Born in the United States |
| 5th row | Native- Born in the United States |
Common Values
| Value | Count | Frequency (%) |
| Native- Born in the United States | 176991 | |
| Foreign born- Not a citizen of U S | 13401 | 6.7% |
| Foreign born- U S citizen by naturalization | 5855 | 2.9% |
| Native- Born abroad of American Parent(s) | 1756 | 0.9% |
| Native- Born in Puerto Rico or U S Outlying | 1519 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| born | 199522 | |
| native | 180266 | |
| in | 178510 | |
| the | 176991 | |
| united | 176991 | |
| states | 176991 | |
| s | 20775 | 1.7% |
| u | 20775 | 1.7% |
| citizen | 19256 | 1.6% |
| foreign | 19256 | 1.6% |
| Other values (12) | 65013 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1247747 | ||
| t | 937391 | |
| e | 754782 | |
| n | 610276 | |
| i | 610039 | |
| a | 395247 | 5.7% |
| o | 259504 | 3.8% |
| r | 232939 | 3.4% |
| - | 199522 | 2.9% |
| S | 197766 | 2.9% |
| Other values (23) | 1453125 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4650767 | |
| Space Separator | 1247747 | 18.1% |
| Uppercase Letter | 796790 | 11.6% |
| Dash Punctuation | 199522 | 2.9% |
| Open Punctuation | 1756 | < 0.1% |
| Close Punctuation | 1756 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 937391 | |
| e | 754782 | |
| n | 610276 | |
| i | 610039 | |
| a | 395247 | |
| o | 259504 | 5.6% |
| r | 232939 | 5.0% |
| v | 180266 | 3.9% |
| s | 178747 | 3.8% |
| d | 178747 | 3.8% |
| Other values (10) | 312829 | 6.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 197766 | |
| U | 197766 | |
| N | 193667 | |
| B | 180266 | |
| F | 19256 | 2.4% |
| P | 3275 | 0.4% |
| A | 1756 | 0.2% |
| R | 1519 | 0.2% |
| O | 1519 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1247747 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 199522 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1756 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5447557 | |
| Common | 1450781 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 937391 | |
| e | 754782 | |
| n | 610276 | |
| i | 610039 | |
| a | 395247 | 7.3% |
| o | 259504 | 4.8% |
| r | 232939 | 4.3% |
| S | 197766 | 3.6% |
| U | 197766 | 3.6% |
| N | 193667 | 3.6% |
| Other values (19) | 1058180 |
Common
| Value | Count | Frequency (%) |
| 1247747 | ||
| - | 199522 | 13.8% |
| ( | 1756 | 0.1% |
| ) | 1756 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6898338 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1247747 | ||
| t | 937391 | |
| e | 754782 | |
| n | 610276 | |
| i | 610039 | |
| a | 395247 | 5.7% |
| o | 259504 | 3.8% |
| r | 232939 | 3.4% |
| - | 199522 | 2.9% |
| S | 197766 | 2.9% |
| Other values (23) | 1453125 |
person_income
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 2 | 16153 |
| 1 | 2698 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 199522 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 180671 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 180671 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 180671 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 199522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 180671 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 199522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 180671 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 180671 | |
| 2 | 16153 | 8.1% |
| 1 | 2698 | 1.4% |
own_bus
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Not in universe | |
|---|---|
| No | 1593 |
| Yes | 391 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 15.872691 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3166951 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not in universe |
|---|---|
| 2nd row | Not in universe |
| 3rd row | Not in universe |
| 4th row | Not in universe |
| 5th row | Not in universe |
Common Values
| Value | Count | Frequency (%) |
| Not in universe | 197538 | |
| No | 1593 | 0.8% |
| Yes | 391 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 197538 | |
| in | 197538 | |
| universe | 197538 | |
| no | 1593 | 0.3% |
| yes | 391 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 594598 | ||
| e | 395467 | |
| i | 395076 | |
| n | 395076 | |
| N | 199131 | 6.3% |
| o | 199131 | 6.3% |
| s | 197929 | 6.2% |
| t | 197538 | 6.2% |
| u | 197538 | 6.2% |
| v | 197538 | 6.2% |
| Other values (2) | 197929 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2372831 | |
| Space Separator | 594598 | 18.8% |
| Uppercase Letter | 199522 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 395467 | |
| i | 395076 | |
| n | 395076 | |
| o | 199131 | |
| s | 197929 | |
| t | 197538 | |
| u | 197538 | |
| v | 197538 | |
| r | 197538 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 199131 | |
| Y | 391 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 594598 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2572353 | |
| Common | 594598 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 395467 | |
| i | 395076 | |
| n | 395076 | |
| N | 199131 | |
| o | 199131 | |
| s | 197929 | |
| t | 197538 | |
| u | 197538 | |
| v | 197538 | |
| r | 197538 |
Common
| Value | Count | Frequency (%) |
| 594598 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3166951 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 594598 | ||
| e | 395467 | |
| i | 395076 | |
| n | 395076 | |
| N | 199131 | 6.3% |
| o | 199131 | 6.3% |
| s | 197929 | 6.2% |
| t | 197538 | 6.2% |
| u | 197538 | 6.2% |
| v | 197538 | 6.2% |
| Other values (2) | 197929 | 6.2% |
week_workd
Real number (ℝ)
ZEROS 
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.175013 |
| Minimum | 0 |
|---|---|
| Maximum | 52 |
| Zeros | 95982 |
| Zeros (%) | 48.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 8 |
| Q3 | 52 |
| 95-th percentile | 52 |
| Maximum | 52 |
| Range | 52 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 24.411494 |
|---|---|
| Coefficient of variation (CV) | 1.0533541 |
| Kurtosis | -1.8638093 |
| Mean | 23.175013 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.21016025 |
| Sum | 4623925 |
| Variance | 595.92105 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 95982 | |
| 52 | 70314 | |
| 40 | 2790 | 1.4% |
| 50 | 2304 | 1.2% |
| 26 | 2268 | 1.1% |
| 48 | 1806 | 0.9% |
| 12 | 1780 | 0.9% |
| 30 | 1378 | 0.7% |
| 20 | 1330 | 0.7% |
| 8 | 1126 | 0.6% |
| Other values (43) | 18444 | 9.2% |
| Value | Count | Frequency (%) |
| 0 | 95982 | |
| 1 | 464 | 0.2% |
| 2 | 458 | 0.2% |
| 3 | 417 | 0.2% |
| 4 | 757 | 0.4% |
| 5 | 309 | 0.2% |
| 6 | 646 | 0.3% |
| 7 | 152 | 0.1% |
| 8 | 1126 | 0.6% |
| 9 | 239 | 0.1% |
| Value | Count | Frequency (%) |
| 52 | 70314 | |
| 51 | 819 | 0.4% |
| 50 | 2304 | 1.2% |
| 49 | 509 | 0.3% |
| 48 | 1806 | 0.9% |
| 47 | 278 | 0.1% |
| 46 | 708 | 0.4% |
| 45 | 669 | 0.3% |
| 44 | 845 | 0.4% |
| 43 | 374 | 0.2% |
income
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| 0 | |
|---|---|
| 1 | 12382 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 199522 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 187140 | |
| 1 | 12382 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 187140 | |
| 1 | 12382 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 187140 | |
| 1 | 12382 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 199522 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 187140 | |
| 1 | 12382 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 199522 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 187140 | |
| 1 | 12382 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 199522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 187140 | |
| 1 | 12382 | 6.2% |
| age | class_of_worker | industry_code | occupation_code | education | wage_per_hour | edu_inst | marital | mace | hispanic | sex | labor_union | reason_unemployment | employment_type | gains | losses | divdends | liability | state_residence | household_summary | instance_weight | migration_msa | migration_reg | migration_within | live_one_year | sunbelt | person_worked | under18 | citizen | person_income | own_bus | week_workd | income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 58 | Self-employed-not incorporated | 4 | 34 | Some college but no degree | 0 | Not in universe | Divorced | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Head of household | Arkansas | Householder | 1053.55 | MSA to MSA | Same county | Same county | No | Yes | 1 | Not in universe | Native- Born in the United States | 0 | Not in universe | 52 | 0 |
| 1 | 18 | Not in universe | 0 | 0 | 10th grade | 0 | High school | Never married | Asian or Pacific Islander | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Child 18 or older | 991.95 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Foreign born- Not a citizen of U S | 0 | Not in universe | 0 | 0 |
| 2 | 9 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1758.14 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| 3 | 10 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1069.16 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| 4 | 48 | Private | 40 | 10 | Some college but no degree | 1200 | Not in universe | Married-civilian spouse present | Amer Indian Aleut or Eskimo | All other | Female | No | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Spouse of householder | 162.61 | ? | ? | ? | Not in universe under 1 year old | ? | 1 | Not in universe | Native- Born in the United States | 2 | Not in universe | 52 | 0 |
| 5 | 42 | Private | 34 | 3 | Bachelors degree(BA AB BS) | 0 | Not in universe | Married-civilian spouse present | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 5178 | 0 | 0 | Joint both under 65 | Not in universe | Householder | 1535.86 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | Native- Born in the United States | 0 | Not in universe | 52 | 0 |
| 6 | 28 | Private | 4 | 40 | High school graduate | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Job loser - on layoff | Unemployed full-time | 0 | 0 | 0 | Single | Not in universe | Nonrelative of householder | 898.83 | ? | ? | ? | Not in universe under 1 year old | ? | 4 | Not in universe | Native- Born in the United States | 0 | Not in universe | 30 | 0 |
| 7 | 47 | Local government | 43 | 26 | Some college but no degree | 876 | Not in universe | Married-civilian spouse present | White | All other | Female | No | Not in universe | Full-time schedules | 0 | 0 | 0 | Joint both under 65 | Not in universe | Spouse of householder | 1661.53 | ? | ? | ? | Not in universe under 1 year old | ? | 5 | Not in universe | Native- Born in the United States | 0 | Not in universe | 52 | 0 |
| 8 | 34 | Private | 4 | 37 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Joint both under 65 | Not in universe | Householder | 1146.79 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | Native- Born in the United States | 0 | Not in universe | 52 | 0 |
| 9 | 8 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 2466.24 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| age | class_of_worker | industry_code | occupation_code | education | wage_per_hour | edu_inst | marital | mace | hispanic | sex | labor_union | reason_unemployment | employment_type | gains | losses | divdends | liability | state_residence | household_summary | instance_weight | migration_msa | migration_reg | migration_within | live_one_year | sunbelt | person_worked | under18 | citizen | person_income | own_bus | week_workd | income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199512 | 57 | Private | 9 | 37 | 9th grade | 0 | Not in universe | Divorced | White | Central or South American | Female | Not in universe | Not in universe | Full-time schedules | 0 | 0 | 0 | Single | Not in universe | Householder | 743.66 | ? | ? | ? | Not in universe under 1 year old | ? | 4 | Not in universe | Foreign born- Not a citizen of U S | 0 | Not in universe | 52 | 0 |
| 199513 | 51 | Private | 33 | 19 | 10th grade | 0 | Not in universe | Widowed | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | North Dakota | Householder | 1302.34 | NonMSA to nonMSA | Same county | Same county | No | Yes | 6 | Not in universe | Native- Born in the United States | 0 | Not in universe | 52 | 0 |
| 199514 | 87 | Not in universe | 0 | 0 | High school graduate | 0 | Not in universe | Widowed | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Single | Not in universe | Householder | 3255.80 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| 199515 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | Black | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Utah | Nonrelative of householder | 2733.75 | MSA to MSA | Same county | Same county | No | Yes | 0 | Mother only present | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| 199516 | 39 | Private | 43 | 26 | Bachelors degree(BA AB BS) | 0 | Not in universe | Never married | Other | Mexican-American | Male | No | Not in universe | Full-time schedules | 6849 | 0 | 0 | Single | Not in universe | Householder | 908.14 | ? | ? | ? | Not in universe under 1 year old | ? | 6 | Not in universe | Foreign born- Not a citizen of U S | 2 | Not in universe | 52 | 0 |
| 199517 | 87 | Not in universe | 0 | 0 | 7th and 8th grade | 0 | Not in universe | Married-civilian spouse present | White | All other | Male | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Joint both 65+ | Not in universe | Householder | 955.27 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Not in universe | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| 199518 | 65 | Self-employed-incorporated | 37 | 2 | 11th grade | 0 | Not in universe | Married-civilian spouse present | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 6418 | 0 | 9 | Joint one under 65 & one 65+ | Not in universe | Householder | 687.19 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 1 | Not in universe | Native- Born in the United States | 0 | Not in universe | 52 | 0 |
| 199519 | 47 | Not in universe | 0 | 0 | Some college but no degree | 0 | Not in universe | Married-civilian spouse present | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 157 | Joint both under 65 | Not in universe | Householder | 1923.03 | ? | ? | ? | Not in universe under 1 year old | ? | 6 | Not in universe | Foreign born- U S citizen by naturalization | 0 | Not in universe | 52 | 0 |
| 199520 | 16 | Not in universe | 0 | 0 | 10th grade | 0 | High school | Never married | White | All other | Female | Not in universe | Not in universe | Not in labor force | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 4664.87 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 |
| 199521 | 32 | Private | 42 | 30 | High school graduate | 0 | Not in universe | Never married | Black | All other | Female | No | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Single | Not in universe | Householder | 1830.11 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 6 | Not in universe | Foreign born- Not a citizen of U S | 0 | Not in universe | 52 | 0 |
Most frequently occurring
| age | class_of_worker | industry_code | occupation_code | education | wage_per_hour | edu_inst | marital | mace | hispanic | sex | labor_union | reason_unemployment | employment_type | gains | losses | divdends | liability | state_residence | household_summary | instance_weight | migration_msa | migration_reg | migration_within | live_one_year | sunbelt | person_worked | under18 | citizen | person_income | own_bus | week_workd | income | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 118 | 0 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1363.88 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 120 | 0 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1366.71 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 654 | 3 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 2125.99 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 2038 | 10 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1185.19 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 2243 | 11 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1131.62 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 2708 | 13 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 981.79 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 2710 | 13 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1013.75 | Nonmover | Nonmover | Nonmover | Yes | Not in universe | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 6 |
| 304 | 1 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Male | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1175.86 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 5 |
| 408 | 2 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 933.97 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 5 |
| 427 | 2 | Not in universe | 0 | 0 | Children | 0 | Not in universe | Never married | White | All other | Female | Not in universe | Not in universe | Children or Armed Forces | 0 | 0 | 0 | Nonfiler | Not in universe | Child under 18 never married | 1182.42 | ? | ? | ? | Not in universe under 1 year old | ? | 0 | Both parents present | Native- Born in the United States | 0 | Not in universe | 0 | 0 | 5 |